Example-based Bi-directional C Translation with Semi-automatic

نویسندگان

  • K. C. Siu
  • Helen M. Meng
  • Hong Kong
  • C. C. Wong
چکیده

We have previously developed a framework for bi-directional English-to-Chinese/Chinese-to-English machine translation using semi-automatically induced grammars from unannotated corpora. The framework adopts an example-based machine translation (EBMT) approach. This work reports on three extensions to the framework. First, we investigate the comparative merits of three distance metrics (Kullback-Leibler, Manhattan-Norm and Gini Index) for agglomerative clustering in grammar induction. Second, we seek an automatic evaluation method that can also consider multiple translation outputs generated for a single input sentence based on the BLEU metric. Third, our previous investigation shows that Chinese-to-English translation has lower performance due to incorrect use of English inflectional forms – a consequence of random selection among translation alternatives. We present an improved selection strategy that leverages information from the example parse trees in our EBMT paradigm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Example-based bi-directional Chinese-English machine translation with semi-automatically induced grammars

We have previously developed a framework for bi-directional English-to-Chinese/Chineseto-English machine translation using semi-automatically induced grammars from unannotated corpora. The framework adopts an example-based machine translation (EBMT) approach. This work reports on three extensions to the framework. First, we investigate the comparative merits of three distance metrics (Kullback-...

متن کامل

An Experimental Multilingual Bi-directional Speech Translation System

We describe an experimental Multilingual Bi-directional speech translation system utilizing small, PC-based hardware with multi-modal user interface. Two major problems for people using an automatic speech translation device are speech recognition errors and language translation errors. We focus on developing techniques to overcome these problems. The techniques include a new language translati...

متن کامل

Bi-directional memory-based dialog translation: The KEMDT approach

Keywords: dialog translation, memory (example)-based translation, parallel marker passing, Korean language processing A bi-directional Korean/English dialog translation system is designed and implemented using the memory-based translation technique. The system KEMDT (Korean/English Memory-based Dialog Translation system) can perform Korean to English, and English to Korean translation using uni...

متن کامل

Bilingual Methods for Adaptive Training Data Selection for Machine Translation

In this paper, we propose a new data selection method which uses semi-supervised convolutional neural networks based on bitokens (Bi-SSCNNs) for training machine translation systems from a large bilingual corpus. In earlier work, we devised a data selection method based on semi-supervised convolutional neural networks (SSCNNs). The new method, Bi-SSCNN, is based on bitokens, which use bilingual...

متن کامل

A Novel Framework for Semi-automatic Video Object Segmentation

A novel framework for semi-automatic video object segmentation is proposed to facilitate user interaction and improve the performance of the system. The proposed framework scans the video sequence more than once, featured as multi-pass scan. In each pass, the sub-shots detected by a self-supervisor are processed under a novel bi-directional auto-tracking algorithm that depends on not only tempo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003